DOLPHIN - 2011 - Annual activity report

DOLPHIN

DOLPHIN - 2011

Project Team Dolphin

Members

Overall Objectives

Scientific Foundations

Application Domains

Software

ParadisEO

New Results

Contracts and Grants with Industry

Contracts with industry

Partnerships and Cooperations

Dissemination

Bibliography

Previous |

Home | Next next

Section: Application Domains

Application to Bioinformatics

Bioinformatic research is a great challenge for our society and numerous research entities of different specialities (biology, medical or information technology) are collaborating on specific thema.

Genomic and post-genomic studies

Previous studies of the DOLPHIN project mainly deal with genomic and postgenomic applications. These have been realized in collaboration with academic and industrial partners (IBL: Biology Institute of Lille; IPL: Pasteur Institute of Lille; IT-Omics firm).

First, genomic studies aim to analyze genetic factors which may explain multi-factorial diseases such as diabetes, obesity or cardiovascular diseases. The scientific goal was to formulate hypotheses describing associations that may have any influence on diseases under study.

Secondly, in the context of post-genomic, a very large amount of data are obtained thanks to advanced technologies and have to be analyzed. Hence, one of the goals of the project was to develop analysis methods in order to discover knowledge in data coming from biological experiments.

These problems can be modeled as classical datamining tasks (Association rules, feature selection). As the combinatoric of such problems is very high and the quality criteria not unique, we proposed to model these problems as multi-objective combinatorial optimization problems. Evolutionary approaches have been adopted in order to cope with large scale problems.

Nowadays the technology is still going fast and the number of data increases rapidly. Within the new collaboration, started in 2010, with Genes Diffusion, specializes in genetics and animal reproduction for bovine, swine, equine and rabbit species, we will study combinations of Single Nucleotide Polymorphisms (SNP) that can explain some phenotypic characteristics.

Docking and conformational sampling

In molecular modelling, conformational sampling and docking procedures provide help for understanding the interaction mechanisms between (macro)molecules involved in physiological processes. The processes to be simulated are of a combinatorial complexity (molecule size, number of degrees of freedom) which represents an important challenge for the currently available computing power. Such a challenge can be expressed by three major objectives: (1) the proposition of mathematical models of maximum simplicity that nevertheless provide a relevant description of molecular behavior, (2) the development of powerful distributed optimization algorithms (evolutionary algorithms, local search methods, hybrid algorithms) for sampling the molecular energy surface for stable, populated conformations, and (3) the deployment of those intrinsic distributed algorithms on computational Grids.

Within the framework of ANR DOCK and Decrypton projects, the focus was to propose multi-objective formulations of the conformational and docking problems. The goal was to take into account different criteria characteristics of the complex docking process. Furthermore, in order to deal with the multimodal nature of the problems it is important to define new hybrid mechanisms allowing us to provide algorithms with both diversification and intensification properties. Finally, to deal with the exponential combinatory of these problems when large proteins are concerned parallel and grid computing is highly required. Using grid computing is not straightforward, so a "gridification" process is necessary. Such process allows us to adapt the proposed algorithms to the characteristics of the grid. The gridification process must be exploited by the user in a transparent way. Therefore, coupling ParadisEO-PEO with a generic grid middleware such as Globus is important to provide robust and efficient algorithms to be exploited transparently.

New contacts with the Servier company show that these questions are really challenging ones for the design of new drug molecules.

Optimization for health care

The new collaboration (PhD thesis started in october 2010) with Alicante company, major actor in the hospital decision making, will deal with knowledge extraction by optimization methods for improving the process of inclusion in clinical trials. Indeed, conducting a clinical trial, allowing for example to measure the effectiveness of a treatment, involves selecting a set of patients likely to participate to this test. Currently existing selection processes are far from optimal, and many potential patients are not considered. The objective of this collaboration consists in helping the practitioner to quickly determine if a patient is interesting for a clinical trial or not. Exploring different data sources (from a hospital information system, patient data...), a set of decision rules have to be generated. For this, approaches from combinatorial optimization will be implemented, requiring extensive work to model the problem, to define criteria optimization and to design specific optimization methods.

Previous |

Home | Next next